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ABSTRACT 

The uses of simulation as a technique for instruction and 
research have been expanding over the past ten years. As an aid 
to educational practitioners, the sampling procedures associated 
with trivariate normal populations are described along with a des- 
cription of an APL/1500 computer program for this level of multi- 
variate sampling in educational research. A sample use of the pro- 
gram is also given. 
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AN APL/1500 PROCEDURE FOR GENERATING RANDOM SAMPLES 
FROM TRIVARIATE NORMAL POPULATIONS 

David B . Thomas 

Introduction 

The uses of simulation as a technique for instruction and 
research have been expanding over the past ten years. Simulation 
of psychological theories and the development of "learning machines" 
have been described by Feigenbaum and Feldman (1963) and others 
(Green, 1969; Snyder, 1968). Instructional simulations have been 
suggested as supplements to expository instructional techniques 
(Raser, 1969). In these instructional applications a formal process 
from physics, chemistry, or mathematics is modeled and presented to 
the student . as an exercise at a cathode ray tube terminal (Kromhout, 
Hansen, & Schv-^arz, 1970). Social science instructional applications 
are described by Boocock and Schild (1968) and by Raser (1969). In 
these uses, the social science process may be modeled and presented as 
a learning game in which the student may act out various roles according 
to the rules of the game. The use of simulation in teacher training 
programs is described by Smith and Smith (1966). 

At the Florida State University Center for Computer-Assisted 
Instruction, both the Coursewriter II (IBM, 1968) and APL (Iverson, 1966) 
programming languages are employed for research in instruction. The 
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FSU-APL system is implemented on an IBM 1800-based 1500 system which 
supports both cathode ray tube and teletype-typewriter terminals. The 
APL/1500 system (Krueger and McMurchie, 1968) permits easy implementation 
of simulation models based on complex mathematical algorithms. When 
coupled with the interactive nature of APL , the system provides a simu- 
lation capability which may be utilized both for research on instruction 
and as a means of instruction in mathematics, physics, chemistry, library 
science, and the social sciences. 

An investigation of an innovative teacher training program, 
with one purpose being the implementation and simulation of a training 
model on a computer system, is described by King (1970). In this investi- 
gation, a methodology was required whereby a normal distribution with 
a specified mean and variance could be randomly sampled. A method 
was available which allowed for sampling from these populations. The 
APL function available employed the method reported by Muller (1959), 
modified so as to reduce the amount of computer space necessary for 
execution. Using the function, independent samples could easily be 
drawn from any normal population with a user specified mean and variance. 
This APL program is reported by Lippert, et al . (1970) as FSU APL/1500 
function FSNU12. 

The APL function (FSNU12) was adequate for drawing a series of 
independent normally distributed random variables. However, as is the 
case with much research in education and psychology, multiple measure- 
ments on the same experimental unit or person were desired. These measure 
ments, rather than being independent, are correlated. In order to 
simulate phenomena in which multiple measuv^ements on the same persons 
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were present, a method was required which would sample from a specified 
multivariate normal population. In such a population, persons are 
represented as points in an n-dimensional space, wherein each of the 
variables is considered a dimension. (A multivariate normal population 
was desired because the majority of statistical measures in education 
and psychology require the assumption of normality (Morrison, 1967))l 

In the multivariate normal population the random variables 
are related to one another through the variance-covariance matrix 
This matrix, in the case of standardized variables, specifies the inter- 
correlations among the variables. By specifying one can then 
interrelate the variables (i.e., the measures on persons) as desired. 

The simulation of the teacher training model used by King 
(1970) required three measures on eadi person, so that it was necessary 
to draw random samples from a trivariate normal population. Tocher 
(1963) presented a method for selecting samples from the bivariate 
normal population when independently distributed N(0,l) random variables 
were available. The remainder of this paper describes the rationale 
for extending Tocher's method, and presents the equations used in the 
APL/1500 functions to sample from the trivariate normal populations. 
Examples of the function's "se are also given. 
Sampling Procedure 

Suppose that the trivariate normal population of interest has 
the following variance-covariance matrix, z, and mean vector y 



12 13 

'21 1 'Z3 

/31 ^32 1 _ 

[0 0 0] 
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in which the p.. are the population values of the correlations between 
variables i and j. If Y^^, Yg, and Y^ are three normally and independently 

distributed random variables , each N(0, 1), then the trivariate normal 
random variables , X^, X^ may be represented as f;inctions of the 
Y^ as fol lows : 

(1) X^=Y^ 

(2) = a^Y^ + 

,(3) X3 = a3Y^ + a^Y^ + ^ 

The task is to compute the constants a^. such that a specific 
variance-covariance matrix, z, of the variables X^ X2 > X3 results. 

For any two normally and independently distributed random vari- 
ables Y. and Y., it is known (Anderson, 1958; Hogg and Craig, 1970) that 

(4) E[Y.-Y,-1 0- 

For correlated standardized random variables Y. and Y- 

(5) EEYiYj] = 

From the requirement that the Y's be distributed normally with mean zero 
and variance 1, it is know that 

(6) ELY?] = i 

From these properties, the values of the weights a^. can be 
derived as follows: 

p]2 = E[XiX2:J from (5) 

E[Y^(a^Y^+ a2Y2)] from (1) and (2) 



So that 



(7) = ^1 by expansion and from 

^ (4) and (6) 
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Now to derive 



1 = E[Xp 



from (6) 
from (2) 



and in a similar manner, we show: 

(8) . 

To derive 33, we set: 
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^13 = ECX-iXg] f'^O"! (5) 

" £[^1(^3^1 + ^4^2 + ^5^3)] fi'om (1) and (3) 



from whi ch 
(9) 

To derive a^, set 



Pl3 = *3 



^23 " ^"^^2^3^ 



from (5) 



= E[(a^Y^ + a^Yg) (a3Y^ + a^Y^ + a^Yg)], 

from (2) and (3) 

and by procedures similar to those above, we find 



from whi ch 
(10) 



= a^a3 + a^a^ 



Pl2Pi3 + H 



'4 =^-^12^13 



1-P 



12 



from (7), (8) , and (9) 



UP 
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finally, to computer a^: 



1 = ECX3] from (6) 



from (3) 
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by expanding and substituting we can show that 



from which 

(11) 

To summarize, 
(7) 

(8) 

(9) 

(10) 



(11) 



1 



Pl3 (P23-P12 P13)' + 



1-P 



12 



l-P^.-( 



13"^P23'Pl2''23J 



1-P 



12 



'12 



1-P 
^13 

'23-^12^3 



1-P 



12 



\l 



'■^3-^^23-^12^23^ 



1-P 



12 



From the foregoing, the following restrictions emerge: 

2 / 

(1) (p^2 1) ""^ order that may be computed (has finite bounds) 

2 2 2 

(2) p^2"^ ^13 "^^23 ^1 + 2 P12P13P23 order that 

remain real ) . 

b 

The X^. may now be computed from the already selected . The 
X^. will then constitute a random sample from the trivariate normal 
population with the variance-covariance matrix i: and m.ean vector y. 
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API 1500 Functions 

The function M employs a modification of the Box-MuHer method 
for generating standardized normal random deviates (see Muller, 1959). 
These are then transformed into random variables from a normal distri- 

2 

bution with given mean y cind variance a before being output. See Lippert, 
et al . (1970) for a description of the use of this function independently 
of the current application (FSNUU). A listing of the function M follows. 



V/VCL3V 

[1] R^ilHu^lL ;1 ] )t(//p/7C ;2]>t0. 5 )x (loo2x(?/^p/? ) v/?)x(2x©i?v?(//^/i^, Ipp^ 
)pi?^8388G08 )*0 . 5 
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The function X_ computes the constants a^ and derives the " 
trivari^te sample points X^, by employing the function M to 

generate independent N(0,1) variables which are manipulated according to 
the procedure given in equations (l)-(3) from the previous section. The 
explicit result of is an r x 3 matrix of data in which each row 
corresponds to the standardized coordinates of the sample point in three 
dimensional space; r sample points are generated. Other straightforward 
measures are required to transform these coordinates into data with 
given means and variances (i.e., these data are standardized normal 
variates). A listing of this function follows: 

v^C L]v 
V R^-n X C \AiB;D 
[1] ( t /C*2 )>l + 2xx/C' )pG 

[2 J /l^CC 1] .Z?.CC2] ( 1-(C'C2]*2 )t(Z^^(CC3]-x /CCI 2] ) ( 1 ]* 

2 )*0 . 5 )*2 )*0 . 5 
[3] /?C;3]^(/?W/ 1/ 3 2 p 0 l)t.x""3f/l 
[H] /?[ ;2]^/?L ; '1 2]t.x2f;l 
[5] '^0 

[63 R^xQ ,OqI^' CORRELATIONS OUTSIDE DOMAIN OF FUNCTION.' 
V 
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Example Use 

Suppose that a sample of size 20 drawn from a trivariate population 
was required with the following covariance matrix: 



1.0 


.5 


.1 


.5 


1 .0 


-.6 


.1 


-.6 


1.0 



The following APL expression would generate the required sample: 

20 K .5 .1 ".5 

Sample data are given below: 



0.7454 


2 


,019 


0 


.1944 


1 .796 


0.8288 


0 


.01032 


0.943 


0 


.471 


1 


.353 


0.4799 


0 


.4117 


"l 


.041 


1 .236 


0 


.441 


0 


.5187 


1.23 


0 


.3218 


0 


.3429 


0.3484 


1 


.037 


0 


.521 


0.8126 


1 


.79 


1 


.117 


1.023 


0 


.004035 


0 


.2626 


0 .7774 


1 


.033 


0 


.02037 


0 .49^8 


0 


.01833 


0 


.2659 


1 .648 


1 


.411 


0 


.4178 


0.1092 


"0 


.2843 


0 


.3142 


0.1421 


0 


.1251 


0 


.3559 


1 . 118 


0 


.05234 


1 


.563 


0.9683 


• 0 


.5068 


0 


.2334 


0.2228 


1 


.654 


2 


.582 


0. 5176 


0 


.9099 


0 


,4405 


0.09321 


0 


.05726 


0 


.464 


0.7093 


1 


.898 


1 


.597 



The sample correlation matrix for these data is 
R 



1 .00 
.47 
.09 



.47 
1 .00 
-.62 



.09 
-.62 
1 .00 



These values are reasonable given the size of the sample and the usual 
sampling errors , 
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It is hoped that these procedures and their logical extensions 
will prove useful to behavioral scientists engaged in training research 
using simulation as a tool. At the present time, educational usages 
of this nature are limited. These functions were employed in an investi- 
gation of a teacher training model (King, 1970). Hopefully, publication 
of these functions will stimulate additional applications of the technique. 
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